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RAW SEQUENCE LISTING 

PATENT APPLICATION US/10/022,366 



DATE: 01/17/2002 
TIME: 06:57:43 



INPUT SET: S36723.raw 
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JChisJRaw-Listing-contains-the Genera! 

Information Section and up to the first 5 pages* 



(i) 



^General Information) 



SEQUENCE LISTING 



Does Not Comply 
Corrected Diskette Needed 



(i) APPLICANT: ^/^/^-W^^^^T t " lo*J±l* 

(A) NAME t Boehringer Mannheim GmbH J< CD\ JJ ^ JU Sh*****^ 

(B) STREE ' i! SaildUuCyi SLf. 112-132 

(C) CITY i Mannhoim 



( B) COUNTRY t Germany 
- (F) POOTAL CODE i 6 8 305 - 



(ii) TITLE OF INVENTION: Fanconi-gene II 

(iii) NUMBER OF SEQUENCES: 2 



■9 



sjL*^trt<J-4S -for <A 



Cv> COftKESFONOOMCE AXXtfESS: 
CA) AOOROSEC 
<B) STREET: 



lOCmf: 
(DI STATE: 
(D COUNTRY: 

en nr. 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS S~> N 

(D) SOFTWARE: Patentln Release #1.0, Version #1,30 MEPO) ) 

(vi) CURRENT APPLICATION DATA 



JRRENT APPLICATION DATAj^ — — 

(A) APPLICATION NUMBER { IJS/09/977 , 801 ^- 



(B) FILING DATE 



(vii) PRIOR APPLICATION DATA: 



(A) APPLICATION NUMBER: US/0 9/ 402,632 

(B) FILING DATE: 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1026 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 256 .. 924 

(ix) FEATURE: 

(A) NAME/KEY: CDS 




PAGE: 2 RAW SEQUENCE LISTING DATE- 01/17/2002 

PATENT APPLICATION US/10/022,366 ' TIME: 06:57:44 

INPUT SET: S36723.raw 

47 (B) LOCATION : 430. . 924 

48 

43- Cxi~) S EQUENCE - DE S CRTPTXOlSn SEQ - 1 D — NO - : T~: 

50 

51 TTTCACCGTC TAGAGGCATA AG AGGTGAG C CCGTGCTCTT CAGCGGAGAA GATCCCCTAC 60 
52 

53 CTGGCCGCCG GCCACTTTCT GTGGGCCGTG GGGTCCTCAA GGAGACGGCC CTTGGGCTCA 120 
54 

55 GGGGCTGCGT TTCCACACGC GCCTTTCCCA GGGCTCCCGC GCCCGTTCCT GCCTGGCCGC 180 
56 

57 CGGCCGCTCC AACAGCAGCA CAAGGCGGGA CTCAGAACCG GCGTTCAGGG CCGCCAGCGG 240 
58 

59 CCGCGAGGCC CTGAG ATG AGG CTC CAA AGA CCC CGA CAG GCC CCG GCG GGT 2 91 

60 Met Arg Leu Gin Arg Pro Arg Gin Ala Pro Ala Gly 

61 15 10 
62 

63 GGG AGG CGC GCG CCC CGG GGC GGG CGG GGC TCC CCC TAC CGG CCA GAC 33 9 

64 Gly Arg Arg Ala Pro Arg Gly Gly Arg Gly Ser Pro Tyr Arg Pro Asp 

65 15 20 25 
66 

67 

68 CCG GGG AGA GGC GCG CGG AGG CTG CGA AGG TTC CAG AAG GGC GGG GAG 3 87 

69 Pro Gly Arg Gly Ala Arg Arg Leu Arg Arg Phe Gin Lys Gly Gly Glu 

70 30 35 ~ 40 
71 

72 GGG GCG CCG CGC GCT GAC CCT CCC TGG GCA CCG CTG GGG ACG ATG GCG 43 5 

73 Gly Ala Pro Arg Ala Asp Pro Pro Trp Ala Pro Leu Gly Thr Met Ala 

74 45 50 55 60 
75 

76 CTG CTC GCC TTG CTG CTG GTC GTG GCC CTA CCG CGG GTG TGG ACA GAC 483 

77 Leu Leu Ala Leu Leu Leu Val Val Ala Leu Pro Arg Val Trp Thr Asp 

78 65 70 75 
79 

80 GCC AAC CYG ACT GCG AGA CAA CGA GAT CCA SAG GAC TCC CAG CGA ACG 531 

81 Ala Asn Xaa Thr Ala Arg Gin Arg Asp Pro Xaa Asp Ser Gin Arg Thr 

82 80 85 ~ 90 
83 

84 GAC GAG GGT GAC AAT AGA GTG TGG TGT CAT GTT TGT GAG AGA GAA AAC 579 

85 Asp Glu Gly Asp Asn Arg Val Trp Cys His Val Cys Glu Arg Glu Asn 

86 95 100 105 
87 

88 ACT TTC GAG TGC CAG AAC CCA AGG AGG TGC AAA TGG ACA GAG CCA TAC 627 

89 Thr Phe Glu Cys Gin Asn Pro Arg Arg Cys Lys Trp Thr Glu Pro Tyr 

90 110 115 120 
91 

92 TGC GTT ATA GCG GCC GTG AAA ATA TTT CCA CGT TTT TTC ATG GTT GCG 675 

93 Cys Val He Ala Ala Val Lys He Phe Pro Arg Phe Phe Met Val Ala 

94 125 130 135 140 
95 

96 AAG CAG TGC TCC GCT GGT TGT GCA GCG ATG GAG AGA CCC AAG CCA GAG 723 

97 Lys Gin Cys Ser Ala Gly Cys Ala Ala Met Glu Arg Pro Lys Pro Glu 

98 145 150 155 
99 



PAGE: 3 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/10/022,366 



DATE: 01/17/2002 
TIME: 06:57:44 



100 


GAG AAG 


CGG 


TTT 


CTC 


CTG 


GAA 


GAG 


CCC 


ATG 


CCC 


TTC TTT TAC 


CTC 


AAG 


101 


Glu Lys 


Arg 


Phe 


Leu 


Leu 


Glu 


Glu 


Pro 


Met 


Pro 


Phe Phe Tyr 

170 


Leu 


Lys 


1.0.2 
103 






160- 










-1-6-5- 












104 


TGT TGT 


AAA 


ATT 


CGC 


TAC 


TGC 


AAT 


TTA 


GAG 


GGG 


CCA CCT ATC 


AAC 


TCA 


105 


Cys Cys 


Lys 


lie 


Arg 


Tyr 


Cys 


Asn 


Leu 


Glu 


Gly 


Pro Pro He 


Asn 


Ser 


106 




175 










180 








185 






107 




























108 


TCA GTG 


TTC 


AAA 


GAA 


TAT 


GCT 


GGG 


AGC 


ATG 


GGT 


GAG AGC TGT 


GGT 


GGG 


109 


Ser Val 


Phe 


Lys 


Glu 


Tyr 


Ala 


Gly Ser 


Met 


Gly 


Glu Ser Cys 


Gly 


Gly 


110 


190: 










195 










200 






111 




























112 


CTG TGG 


CTG 


GCC 


ATC 


CTC 


CTG 


CTG 


CTG 


GCC 


TCC 


ATT GCA GCC 


GGC 


CTC 


113 


Leu Trp 


Leu 


Ala 


He 


Leu 


Leu 


Leu 


Leu 


Ala 


Ser 


He Ala Ala 


Gly 


Leu 


114 


205 








210 










215 






220 


115 




























116 


AGC CTG 


TCT 


TGAGCCACGG GACTGCCACA GACTGAGCCT TCCGGAGCAT 






117 


Ser Leu 


Ser 
























118 




























119 




























120 


GGACTCGCTC CAGACCGTTG TCACCTGTTG CATTAAACTT 


GTTTTCTGTT GAAAAAAAi 


121 




























122 


AA 


























123 




























124 




























125 


(2) INFORMATION 


FOR 


SEQ 


ID NO: 2: 












126 




























127 




(i) SEQUENCE 


CHARACTERISTICS : 












128 




(A) LENGTH: 223 amino 


acids 










129 




(B) TYPE: 


amino acids 














130 




(D) TOPOLOGY: 


linear 














131 




























132 




(ii) 


MOLECULE TYPE: protein 












133 




























134 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


: 2: 






135 




























136 




























137 


Met Arg 


Leu 


Gin 


Arg 


Pro 


Arg 


Gin 


Ala 


Pro 


Ala 


Gly Gly Arg Arg Ala 


138 


1 






5 










10 






15 




139 




























140 


Pro Arg 


Gly 


Gly 


Arg 


Gly 


Ser 


Pro 


Tyr 


Arg 


Pro 


Asp Pro Gly Arg 


Gly 


141 






20 










25 






30 






142 




























143 


Ala Arg 


Arg 


Leu 


Arg 


Arg 


Phe 


Gin 


Lys 


Gly 


Gly 


Glu Gly Ala 


Pro Arg 


144 




35 










40 








45 






145 




























146 


Ala Asp 


Pro 


Pro 


Trp 


Ala 


Pro 


Leu 


Gly 


Thr 


Met 


Ala Leu Leu 


Ala 


Leu 


147 


50 










55 










60 






148 




























149 


Leu Leu 


Val 


Val 


Ala 


Leu 


Pro 


Arg 


Val 


Trp 


Thr 


Asp Ala Asn Xaa Thr 


150 


65 








70 










75 






80 


151 




























152 


Ala Arg 


Gin 


Arg 


Asp 


Pro 


Xaa 


Asp 


Ser 


Gin 


Arg 


Thr Asp Glu Gly Asp 



INPUT SET: S36723.raw 
ill 



819 



867 



915 



964 



1026 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/10/022,366 



DATE: 01/17/2002 
TIME: 06:57:44 



INPUT SET: S36723.raw 

153 85 90 95 

154 



-1-55— 


— Asn— Arg-Va-1— Trp- 


"Cys" 


-Hi"s Val 


Cys 


Glu 


Arg 


Glu 


Asn 


Thr 


Phe 


Glu 


Cys 


156 






100 










105 










110 






157 
































158 


Gln 


Asn 


Pro Arg 


Arg 


Cys 


Lys 


Trp 


Thr 


Glu 


Pro 


Tyr 


Cys 


Val 


He 


Ala 


159 






115 








120 










125 








160 
































161 


Ala 


Val 


Lys He 


Phe 


Pro 


Arg 


Phe 


Phe 


Met 


Val 


Ala 


Lys 


Gin 


Cys 


Ser 


162 




13 0 








135 










140 










163 
































164 


Ala 


Gly 


Cys Ala 


Ala 


Met 


Glu 


Arg 


Pro 


Lys 


Pro 


Glu 


Glu 


Lys 


Arg 


Phe 


165 


145 








150 










155 










160 


166 
































167 


Leu 


Leu 


Glu Glu 


Pro 


Met 


Pro 


Phe 


Phe 


Tyr 


Leu 


Lys 


Cys 


Cys 


Lys 


He 


168 








165 










170 










175 




169 
































170 


Arg 


Tyr 


Cys Asn 


Leu 


Glu 


Gly 


Pro 


Pro 


He 


Asn 


Ser 


Ser 


Val 


Phe 


Lys 


171 






180 










185 










190 






172 
































173 


Glu 


Tyr 


Ala Gly 


Ser 


Met 


Gly 


Glu 


Ser 


Cys 


Gly 


Gly 


Leu 


Trp 


Leu 


Ala 


174 






195 








200 










205 








175 
































176 


He 


Leu 


Leu Leu 


Leu 


Ala 


Ser 


He 


Ala 


Ala 


Gly 


Leu 


Ser 


Leu 


Ser 




177 




210 








215 










220 










178 
































179 
































180 
































181 
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SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/10/022,366 



DATE: 01/17/2002 
TIME: 06:57:45 



INPUT SET: S36723.raw 



— — Line Error 

5 Mandatory Value Not Present 

6 Unknown or Misplaced Identifier 

7 Unknown or Misplaced Identifier 

8 Unknown or Misplaced Identifier 

9 Unknown or Misplaced Identifier 

10 Unknown or Misplaced Identifier 
23 Wrong application Serial Number 



Original Text 
(i) APPLICANT: 

(A) NAME: Boehringer Mannheim GmbH 

(B) STREET: Sandhofer Str. 112-132 

(C) CITY: Mannheim 

(E) COUNTRY: Germany 

(F) POSTAL CODE: 68305 

(A) APPLICATION NUMBER:US/09/977,801 



PAGE: 1 



DATE: 01/17/2002 
TIME: 06:57:45 



INPUT SET: S36723.raw 



ADDRESSEE 

STREET 

CITY 

STATE 

COUNTRY 

ZIP 

CORRESPONDENCE ADDRESS 
CLASSIFICATION 



